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METHOD, SYSTEM, AND PRODUCT FOR ALLEVIATING ROUTER 

CONGESTION 

BACKGROUND OF THE INVENTION 

5 

1. Technical Field: 

The present invention relates generally to the field 
of computer systems and, more specifically to computer 
systems including a method, system, and apparatus for 
10 alleviating router congestion when the router is 

processing packets transmitted by computer systems having 
a congestion notification capability. 

2. Description of Related Art: 

15 When TCP/IP is used for data transmission between 

computer systems, the transmitted data is routed through 
intermediate routers . These routers may experience 
congestion. When the router is severely congested, the 
router will drop packets indiscriminately in order to 

20 reduce the congestion and to indicate to the sending 
computer systems that the router is congested. 

When the router is only moderately congested, a 
method of congestion notification has been proposed which 
requires the marking of packets to indicate congestion to 

25 the sending computer systems. An addition to the TCP/IP 
protocol, called Explicit Congestion Notification (ECN) , 
has been proposed as a method of indicating congestion to 
the sending computer systems. The ECN describes a method 
of marking packets in order to provide an indication of 

30 moderate congestion prior to the router actually dropping 
packets. Each packet includes an ECN bit and a 
Congestion Experienced (CE) bit in the IP header of the 
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packet. The ECN bit may be set by the sender to indicate 
that the sender and receiver of this packet have the ECN 
capabilities. The CE bit is set by the router through 
which a packet passed when the router is experiencing 
5 moderate congestion. 

When a router which utilizes the ECN protocol is 
moderately congested, the router will mark and pass 
packets having the ECN bit set, and will drop packets 
which do not have the ECN bit set. Therefore, packets 

10 transmitted by senders which have the ECN capability 
receive preference over packets transmitted by senders 
which do not have the ECN capability. 

When senders are notified that a router has become 
moderately congested, the senders should either cease or 

15 slow transmissions in order to relieve the congestion. 
If a sender without the ECN capability continues to 
transmit packets, these packets will be dropped. If a 
sender having the ECN capability continues to transmit 
packets, its packets will not be dropped until the router 

20 becomes severely congested. Therefore, it is possible 
for senders having the ECN capability to abuse the ECN 
protocol by continuing to transmit packets even after 
being notified that the router is moderately congested. 
Therefore, a need exists for a system, method, and 

25 product for reducing preferential treatment given to 
packets transmitted by computer systems having a 
congestion notification capability. 
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SUMMARY OF THE INVENTION 

The present invention is a method, system, and 
product for alleviating router congestion when the router 
5 is processing packets transmitted by computer systems 
having a congestion notification capability. Routers 
receive packets transmitted by senders which are capable 
of receiving a notification that the router is moderately 
congested. The routers also receive packets from senders 

10 which do not have a capability of receiving such a 
notification. 

When a router is moderately congested, the router 
will mark packets which were transmitted by senders which 
have the congestion notification capability. The marking 

15 indicates that the router is moderately congested. The 
receiving computer system then receives the packet and is 
thus notified that the router is congested. The 
receiving computer system will respond to the received 
packet by transmitting a packet, such as an 

20 acknowledgment, back to the sending computer system. 

This acknowledgment packet will similarly be marked by 
the router when the acknowledgment packet is forwarded by 
the router. When the sending computer system receives 
this acknowledgment packet which has been marked by the 

25 router, the sending computer system is thus notified that 
the router is moderately congested. The sending computer 
system then should take an action to reduce the router's 
congestion, such as by stopping or postponing 
transmitting packets utilizing this router. If the 

30 sending computer system continues to transmit packets 

utilizing the router after the sender has been notified 
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that the router is congested, the router will alleviate 
its congestion by dropping these packets transmitted by a 
sender which has been notified that the router is 
congested, 

5 Thereafter, the router will monitor the packets that 

it receives to determine whether any of these packets 
were transmitted by senders which have the congestion 
notification capability and which have already been 
notified that the router is moderately congested. If the 

10 router receives packets from senders which have this 
capability but have ignored the notification and have 
continued to transmit packets despite the notification, 
the router will drop these packets until it reaches a 
state where congestion clears, 

15 Packets transmitted by senders having congestion 

notification but which have ignored the notification are 
dropped. Packets from senders which have the congestion 
notification but which have not yet been notified that 
the router is congested will continue to be forwarded. 

20 Packets from senders which do not have the congestion 
notification capability are dropped. 

In order for the router to determine whether a sender 
having the congestion notification capability has been 
notified that the router is moderately congested, the 

25 router maintains a listing of identifiers. Each 

identifier identifies a TCP connection between a sending 
computer system and a receiving computer system. 

The listing of unique identifiers identify TCP 
session connections. Each TCP session connection is 

30 uniquely identified by a unique identifier. Each packet 
includes this unique identifier. 
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When the router receives a packet from a computer 
system having the congestion notification capability, the 
router will first check its list of identifiers to 
determine whether the identifier which identifies this 
5 computer system's connection is listed. If the router 
does not find the identifier in the list which identifies 
the connection between this sending computer system and a 
receiving computer system, the router will mark the 
packet, store a copy of the identifier in the list, and 
10 store the current time in the list along with the 
identifier . 

If the router does find the identifier in the list, 
the router will retrieve the time stored in the list with 
the identifier. The router then determines a round trip 

15 time which is the time required for a packet to be 

transmitted from the sending computer to the receiving 
computer and back to the sending computer. The router 
then calculates a transmission time which is the round 
trip time added to the time stored in the listing. 

20 When the current time is greater than the 

transmission time, the means that enough time has passed 
for the sending computer system to receive an 
acknowledgment packet from the receiving computer system 
which has been marked by the router and thus have been 

25 notified that the router is moderately congested. 
Therefore, if the current time is greater than the 
transmission time, the router will drop the packet because 
the sending computer system is presumed to have been 
notified that the router is moderately congested. 

30 When the current time is less than the transmission 

time, the means that not enough time has passed for the 
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sending computer system to receive an acknowledgment 
packet from the receiving computer system which has been 
marked by the router. In this case the sending computer 
system could not have been notified that the router is 
5 moderately congested. Therefore, if the current time is 
less than the transmission time, the router will forward 
the packet because the sending computer system has not 
been notified that the router is congested. 

When a router goes back to the normal non-congested 
10 state, the list of identifiers is deleted. The router will 
start building a new list when moderate congestion occurs 
again. 

The above as well as additional objectives, 
features, and advantages of the present invention will 
15 become apparent in the following detailed written 
description. 
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BRIEF DESCRIPTION OP THE DRAWINGS 

The novel features believed characteristic of the 
5 invention are set forth in the appended claims. The 

invention itself, however, as well as a preferred mode of 
use, further objectives and advantages thereof, will best 
be understood by reference to the following detailed 
description of an illustrative embodiment when read in 
10 conjunction with the accompanying drawings, wherein: 

Figure 1 depicts a block diagram of a data 
processing system in which the present invention may be 
implemented; 

Figure 2 illustrates a block diagram of a computer 
15 system which may be utilized as a server computer system 
in accordance with the present inventions- 
Figure 3 depicts a block diagram of a computer 
system which may be utilized as a client computer system 
in accordance with the present invention; and 
20 Figure 4 is a high level flow chart which depicts a 

router selectively discarding packets in order to 
alleviate router congestion when the router is processing 
packets transmitted by senders which have a congestion 
notification capability in accordance with the present 
25 invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIiyiENT 

A preferred embodiment of the present invention and 
its advantages are better understood by referring to the 
5 figures, like numerals being used for like and 
corresponding parts of the accompanying figures. 

The invention is preferably realized using a 
well-known computing platform, such as an IBM RS/6000 
server running the IBM AIX operating system. However, it 

10 may be realized in any computer system platforms, such as 
an IBM personal computer running the Microsoft Windows 
operating system or a Sun Microsystems workstation 
running operating systems such as UNIX or LINUX or a 
router system from Cisco or Juniper, without departing 

15 from the spirit and scope of the invention. 

With reference now to the figures, Figure 1 depicts a 
pictorial representation of a network of data processing 
systems in which the present invention may be implemented. 
Network data processing system 100 is a network of 

20 computers in which the present invention may be 

implemented. Network data processing system 100 contains 
a network 102, which is the medium used to provide 
communications links between various devices and computers 
connected together within network data processing system 

25 100. Network 102 may include connections, such as wire, 
wireless communication links, or fiber optic cables. 

In the depicted example, a server 104 is connected to 
network 102 along with storage unit 106. In addition, 
clients 108, 110, and 112 also are connected to network 

30 102. These clients 108, 110, and 112 may be, for example, 
personal computers or network computers. In the depicted 
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example, server 104 provides data, such as boot files, 
operating system images, and applications to clients 
108-112. Clients 108, 110, and 112 are clients to server 
104. Network data processing system 100 may include 
5 additional servers, clients, and other devices not shown. 
In the depicted example, network data processing system 
100 is the Internet with network 102 representing a 
worldwide collection of networks and gateways that use the 
TCP/IP suite of protocols to communicate with one another. 

10 At the heart of the Internet is a backbone of high-speed 
data communication lines between major nodes or host 
computers, consisting of thousands of commercial, 
government, educational and other computer systems that 
route data and messages. Of course, network data 

15 processing system 100 also may be implemented as a number 
of different types of networks, such as for example, an 
intranet, a local area network (LAN) , or a wide area 
network (WAN) . Figure 1 is intended as an example, and not 
as an architectural limitation for the present invention. 

20 Referring to Figure 2, a block diagram of a data 

processing system that may be implemented as a server, 
such as server 104 in Figure 1, is depicted in accordance 
with a preferred embodiment of the present invention. 
Data processing system 200 may be a symmetric 

25 multiprocessor (SMP) system including a plurality of 
processors 202 and 204 connected to system bus 206. 
Alternatively, a single processor system may be employed. 
Also connected to system bus 206 is memory 

controller/cache 208, which provides an interface to local 
30 memory 209. I/O bus bridge 210 is connected to system bus 
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206 and provides an interface to I/O bus 212. Memory 
controller/cache 208 and I/O bus bridge 210 may be 
integrated as depicted. 

Peripheral component interconnect (PCI) bus bridge 
5 214 connected to I/O bus 212 provides an interface to PCI 
local bus 216. A number of modems may be connected to PCI 
bus 216. Typical PCI bus implementations will support 
four PCI expansion slots or add-in connectors. 
Communications links to network computers 108-112 in 
10 Figrure 1 may be provided through modem 218 and network 

adapter 220 connected to PCI local bus 216 through add-in 
boards . 

Additional PCI bus bridges 222 and 224 provide 
interfaces for additional PCI buses 226 and 228, from 

15 which additional modems or network adapters may be 

supported. In this manner, data processing system 200 
allows connections to multiple network computers, A 
memory-mapped graphics adapter 230 and hard disk 232 may 
also be connected to I/O bus 212 as depicted, either 

20 directly or indirectly. 

Those of ordinary skill in the art will appreciate 
that the hardware depicted in Figure 2 may vary. For 
example, other peripheral devices, such as optical disk 
drives and the like, also may be used in addition to or in 

25 place of the hardware depicted. The depicted example is 
not meant to imply architectural limitations with respect 
to the present invention. 

The data processing system depicted in Figure 2 may 
be, for example, an IBM RISC /System 6000 system, a product 

30 of International Business Machines Corporation in Armonk, 
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New York, running the Advanced Interactive Executive (AIX) 
operating system. 

With reference now to Figure 3, a block diagram 
illustrating a data processing system is depicted in which 
5 the present invention may be implemented. Data processing 
system 300 is an example of a client computer. Data 
processing system 300 employs a peripheral component 
interconnect (PCI) local bus architecture. Although the 
depicted example employs a PCI bus, other bus 

10 architectures such as Accelerated Graphics Port (AGP) and 
Industry Standard Architecture (ISA) may be used. 
Processor 302 and main memory 304 are connected to PCI 
local bus 306 through PCI bridge 308. PCI bridge 308 also 
may include an integrated memory controller and cache 

15 memory for processor 302. Additional connections to PCI 
local bus 306 may be made through direct component 
interconnection or through add- in boards. In the depicted 
example, local area network (LAN) adapter 310, SCSI host 
bus adapter 312, and expansion bus interface 314 are 

20 connected to PCI local bus 306 by direct component 

connection. In contrast, audio adapter 316, graphics 
adapter 318, and audio/video adapter 319 are connected to 
PCI local bus 306 by add-in boards inserted into expansion 
slots. Expansion bus interface 314 provides a connection 

25 for a keyboard and mouse adapter 320, modem 322, and 

additional memory 324. Small computer system interface 
(SCSI) host bus adapter 312 provides a connection for hard 
disk drive 326, tape drive 328, and CD-ROM drive 330. 
Typical PCI local bus implementations will support three 

30 or four PCI expansion slots or add- in connectors. 
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An operating system runs on processor 302 and is used 
to coordinate and provide control of various components 
within data processing system 300 in Figure 3. The 
operating system may be a commercially available operating 
5 system, such as Windows 2000, which is available from 
Microsoft Corporation. An object oriented programming 
system such as Java may run in conjunction with the 
operating system and provide calls to the operating system 
from Java programs or applications executing on data 

10 processing system 300. '"Java" is a trademark of Sun 

Microsystems, Inc. Instructions for the operating system, 
the object-oriented operating system, and applications or 
programs are located on storage devices, such as hard disk 
drive 326, and may be loaded into main memory 304 for 

15 execution by processor 302. 

Those of ordinary skill in the art will appreciate 
that the hardware in Figure 3 may vary depending on the 
implementation. Other internal hardware or peripheral 
devices, such as flash ROM (or equivalent nonvolatile 

20 memory) or optical disk drives and the like, may be used 
in addition to or in place of the hardware depicted in 
Figure 3. Also, the processes of the present invention 
may be applied to a multiprocessor data processing 
system. 

25 As another example, data processing system 300 may 

be a stand-alone system configured to be bootable without 
relying on some type of network communication interface, 
whether or not data processing system 300 comprises some 
type of network communication interface. As a further 

30 example, data processing system 300 may be a Personal 

Digital Assistant (PDA) device, which is configured with 
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ROM and/or flash ROM in order to provide non-volatile 
memory for storing operating system files and/or 
user-generated data. 

The depicted example in Figure 3 and above-described 
5 examples are not meant to imply architectural 

limitations. For example, data processing system 300 
also may be a notebook computer or hand held computer in 
addition to taking the form of a PDA. Data processing 
system 300 also may be a kiosk or a Web appliance. 

10 Figure 4 is a high level flow chart which depicts a 

router selectively discarding packets from senders which 
continue to transmit utilizing moderately congested 
routers despite being notified that the router is 
congested in accordance with the present invention. The 

15 process starts as depicted by block 400 and thereafter 
passes to block 402 which illustrates a router receiving 
a packet. Next, block 404 depicts a determination of 
whether or not the router is moderately congested. If a 
determination is made that the router is not moderately 

20 congested, the process passes to block 406 which 

illustrates the router forwarding the packet and clearing 
a list of unique identifiers. The process then passes to 
block 402. 

Referring again to block 404, if a determination is 
25 made that the router is moderately congested, the process 
passes to block 408 which depicts a determination of 
whether or not the packet was transmitted by a sender 
having a capability of receiving congestion 
notifications, such as ECN. If a determination is made 
30 that this sender does not have a congestion notification 
capability, the process passes to block 410 which 
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illustrates the router dropping the packet. The process 
then passes back to block 402. 

Referring again to block 408, if a determination is 
made that this sender does have a congestion notification 
5 capability, the process passes to block 412 which depicts 
the router getting the unique identifier which identifies 
the TCP session connection to which this packet belongs. 
Each TCP session will have an associated unique 
identifier which uniquely identifies the session. Every 

10 packet will have information the identifies its 

associated TCP session. Next, block 414 illustrates the 
router searching its listing of unique identifiers. 
Thereafter, block 416 depicts a determination of whether 
or not the unique identifier which identifies this 

15 packet's TCP session is stored in the list. If a 

determination is made that the unique identifier which 
identifies this packet's TCP session was not found in the 
list, the process passes to block 418 which depicts the 
router storing the unique identifier which identifies 

20 this packet's TCP session in the listing. The current 
time (t) is also stored in the listing along with the 
unique identifier for this packet's session. The process 
then passes to block 420 which illustrates the router 
marking the packet according to congestion notification 

25 protocols, such as ECN, as having passed through a 

moderately congested router. This marked packet is then 
forwarded to its intended receiver. The process then 
passes back to block 402. 

Referring again to block 416, if a determination is 

30 made that the unique identifier which identifies this 
packet's session was found in the listing, the process 



15 



Docket No. AUS920000942US1 

passes to block 422 which illustrates the router 
retrieving the time (t) stored in the listing with the 
unique identifier which identifies this packet's session. 
Next, block 424 depicts the router calculating a 
5 transmission time. The transmission time is the time 
stored with this listing plus the estimated round trip 
time for a packet. Next, block 426 depicts a 
determination of whether or not the current time is 
greater than the transmission time. If a determination 

10 is made that the current time is greater than the 

transmission time, the process passes back to block 410. 
Referring again to block 426, if a determination is made 
that the current time is not greater than the 
transmission time, the process passes to block 420. 

15 It is important to note that while the present 

invention has been described in the context of a fully 
functioning data processing system, those of ordinary 
skill in the art will appreciate that the processes of 
the present invention are capable of being distributed in 

20 the form of a computer readable medium of instructions 
and a variety of forms and that the present invention 
applies equally regardless of the particular type of 
signal bearing media actually used to carry out the 
distribution. Examples of computer readable media 

25 include recordable- type media such a floppy disc, a hard 
disk drive, a RAM, and CD-ROMs and transmission- type 
media such as digital and analog communications links. 

The description of the present invention has been 
presented for purposes of illustration and description, 

30 but is not intended to be exhaustive or limited to the 
invention in the form disclosed. Many modifications and 
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variations will be apparent to those of ordinary skill in 
the art. The embodiment was chosen and described in 
order to best explain the principles of the invention, 
the practical application, and to enable others of 
5 ordinary skill in the art to understand the invention for 
various embodiments with various modifications as are 
suited to the particular use contemplated. 



